智能论文笔记

Improving the Predictive Performances of $k$ Nearest Neighbors Learning by Efficient Variable Selection

Eddie Pei , Ernest Fokoue

分类： (统计)机器学习 | 机器学习

2022-11-04

This paper computationally demonstrates a sharp improvement in predictive performance for $k$ nearest neighbors thanks to an efficient forward selection of the predictor variables. We show both simulated and real-world data that this novel repeatedly approaches outperformance regression models under stepwise selection

translated by 谷歌翻译

A Computational Exploration of Emerging Methods of Variable Importance Estimation

Louis Mozart Kamdem , Ernest Fokoue

分类： (统计)机器学习 | 机器学习

2022-08-05

估计变量的重要性是现代机器学习的重要任务。这有助于评估给定模型中功能的优点。在过去的十年中，已经开发了几种估计变量重要性的技术。在本文中，我们提出了对可变重要性估计的新兴方法的计算和理论探索，即：绝对收缩和选择操作员（LASSO），支持向量机（SVM），预测误差函数（Perf），随机森林（随机森林）（ RF）和极端梯度提升（XGBOOST）在不同类型的现实生活和模拟数据上进行了测试。所有这些方法都可以无缝处理回归和分类任务，但是在处理包含丢失值的数据时都失败了。该实现表明，在高度相关数据的情况下，PURD具有最佳性能，紧随其后的是RF。 perf和xgboost是“渴望数据”的方法，它们在小数据尺寸上的性能最差，但在执行时间方面它们是最快的。当数据集中许多冗余功能时，SVM是最合适的。 perf的盈余是其自然截止量的零截止，有助于将正面和负分数分开，所有正分数表明基本和重要的特征，而负面分数表示无用的特征。 RF和Lasso的通用性非常多，尽管它们没有给予最佳效果，但它们几乎可以在所有情况下使用。

translated by 谷歌翻译

Hierarchical multimodal transformers for Multi-Page DocVQA

Rubèn Tito , Dimosthenis Karatzas , Ernest Valveny

分类：计算机视觉 | 人工智能 | 自然语言处理

2022-12-07

Document Visual Question Answering (DocVQA) refers to the task of answering questions from document images. Existing work on DocVQA only considers single-page documents. However, in real scenarios documents are mostly composed of multiple pages that should be processed altogether. In this work we extend DocVQA to the multi-page scenario. For that, we first create a new dataset, MP-DocVQA, where questions are posed over multi-page documents instead of single pages. Second, we propose a new hierarchical method, Hi-VT5, based on the T5 architecture, that overcomes the limitations of current methods to process long multi-page documents. The proposed method is based on a hierarchical transformer architecture where the encoder summarizes the most relevant information of every page and then, the decoder takes this summarized information to generate the final answer. Through extensive experimentation, we demonstrate that our method is able, in a single stage, to answer the questions and provide the page that contains the relevant information to find the answer, which can be used as a kind of explainability measure.

translated by 谷歌翻译

A Hyperspectral and RGB Dataset for Building Facade Segmentation

Nariman Habili , Ernest Kwan , Weihao Li , Christfried Webers , Jeremy Oorloff , Mohammad Ali Armin , Lars Petersson

分类：计算机视觉

2022-12-06

Hyperspectral Imaging (HSI) provides detailed spectral information and has been utilised in many real-world applications. This work introduces an HSI dataset of building facades in a light industry environment with the aim of classifying different building materials in a scene. The dataset is called the Light Industrial Building HSI (LIB-HSI) dataset. This dataset consists of nine categories and 44 classes. In this study, we investigated deep learning based semantic segmentation algorithms on RGB and hyperspectral images to classify various building materials, such as timber, brick and concrete.

translated by 谷歌翻译

Portmanteauing Features for Scene Text Recognition

Yew Lee Tan , Ernest Yu Kai Chew , Adams Wai-Kin Kong , Jung-Jae Kim , Joo Hwee Lim

分类：计算机视觉

2022-11-09

Scene text images have different shapes and are subjected to various distortions, e.g. perspective distortions. To handle these challenges, the state-of-the-art methods rely on a rectification network, which is connected to the text recognition network. They form a linear pipeline which uses text rectification on all input images, even for images that can be recognized without it. Undoubtedly, the rectification network improves the overall text recognition performance. However, in some cases, the rectification network generates unnecessary distortions on images, resulting in incorrect predictions in images that would have otherwise been correct without it. In order to alleviate the unnecessary distortions, the portmanteauing of features is proposed. The portmanteau feature, inspired by the portmanteau word, is a feature containing information from both the original text image and the rectified image. To generate the portmanteau feature, a non-linear input pipeline with a block matrix initialization is presented. In this work, the transformer is chosen as the recognition network due to its utilization of attention and inherent parallelism, which can effectively handle the portmanteau feature. The proposed method is examined on 6 benchmarks and compared with 13 state-of-the-art methods. The experimental results show that the proposed method outperforms the state-of-the-art methods on various of the benchmarks.

translated by 谷歌翻译

Adaptive Bias Correction for Improved Subseasonal Forecasting

Soukayna Mouatadid , Paulo Orenstein , Genevieve Flaspohler , Judah Cohen , Miruna Oprescu , Ernest Fraenkel , Lester Mackey

分类：机器学习 | (统计)机器学习

2022-09-21

季节预测$ \ unicode {x2013} $预测温度和降水量为2至6周$ \ unicode {x2013} $，对于有效的水分配，野火管理，干旱和缓解洪水至关重要。最近的国际研究工作提高了操作动力学模型的亚季节能力，但是温度和降水预测技能仍然很差，部分原因是代表动态模型内大气动力学和物理学的顽固错误。为了应对这些错误，我们引入了一种自适应偏置校正（ABC）方法，该方法将最新的动力学预测与使用机器学习的观察结合在一起。当应用于欧洲中等天气预测中心（ECMWF）的领先的亚季节模型时，ABC将温度预测技能提高了60-90％，在美国的连续美国，降水预测技能提高了40-69％基于Shapley队列的实用工作流程，用于解释ABC技能的提高并根据特定的气候条件识别机遇的高技能窗口。

translated by 谷歌翻译

Neuro-symbolic Models for Interpretable Time Series Classification using Temporal Logic Description

Ruixuan Yan , Tengfei Ma , Achille Fokoue , Maria Chang , Agung Julius

分类：机器学习 | 人工智能

2022-09-15

大多数现有的时间序列分类（TSC）模型缺乏可解释性，难以检查。可解释的机器学习模型可以帮助发现数据中的模式，并为域专家提供易于理解的见解。在这项研究中，我们提出了神经符号时间序列分类（NSTSC），这是一种利用信号时间逻辑（STL）和神经网络（NN）的神经符号模型，使用多视图数据表示并将模型表示为TSC任务人类可读，可解释的公式。在NSTSC中，每个神经元与符号表达相关，即STL（sub）公式。因此，NSTSC的输出可以解释为类似于自然语言的STL公式，描述了隐藏在数据中的时间和逻辑关系。我们提出了一个基于NSTSC的分类器，该分类器采用决策树方法来学习公式结构并完成多类TSC任务。 WSTL提出的平滑激活功能允许以端到端的方式学习模型。我们在来自UCR时间序列存储库中的小鼠和基准数据集的现实伤口愈合数据集上测试NSTSC，这表明NSTSC与最先进的模型实现了可比的性能。此外，NSTSC可以生成与域知识匹配的可解释公式。

translated by 谷歌翻译

Expressive Reasoning Graph Store: A Unified Framework for Managing RDF and Property Graph Databases

Sumit Neelam , Udit Sharma , Sumit Bhatia , Hima Karanam , Ankita Likhyani , Ibrahim Abdelaziz , Achille Fokoue , L. V. Subramaniam

分类：人工智能

2022-09-13

资源说明框架（RDF）和属性图（PG）是表示，存储和查询图数据的两个最常用的数据模型。我们提出了表达推理图存储（ERGS） - 构建在Janusgraph（属性图存储）顶部的图存储，该图还允许存储和查询RDF数据集。首先，我们描述了如何将RDF数据转换为属性图表示，然后描述将SPARQL查询转换为一系列Gremlin遍历的查询翻译模块。因此，开发的转换器和翻译器可以允许任何Apache TinkerPop符合图形数据库存储和查询RDF数据集。我们证明了使用JanusGraph作为基本属性图存储的建议方法的有效性，并将其性能与标准RDF系统进行比较。

translated by 谷歌翻译

Blind Users Accessing Their Training Images in Teachable Object Recognizers

Jonggi Hong , Jaina Gandhi , Ernest Essuah Mensah , Ebrima H Jarjue , Kyungjun Lee , Hernisa Kacorri

分类：计算机视觉

2022-08-16

培训和评估机器学习模型的迭代是提高其性能的重要过程。但是，尽管可教学的接口使盲人用户能够在其独特的环境中拍摄的照片训练和测试对象识别器，但训练迭代和评估步骤的可访问性很少受到关注。迭代假设训练照片的目视检查，对于盲人用户来说是无法访问的。我们通过MyCam探索了这一挑战，Mycam是一个移动应用程序，该应用程序合并了自动估计的描述符，以在用户培训集中对照片进行非视觉访问。我们探索盲人参与者（n = 12）如何通过他们的家中的评估研究与mycam和描述符相互作用。我们证明，实时照片级描述符使盲人用户能够用裁剪的对象减少照片，并且参与者可以通过迭代并访问其训练集的质量来增加更多的变化。此外，参与者发现该应用程序易于使用，表明他们可以有效地训练它，并且描述符很有用。但是，主观反应并未反映在其模型的性能中，部分原因是训练和混乱背景的变化很小。

translated by 谷歌翻译

Limits of an AI program for solving college math problems

Ernest Davis

分类：人工智能

2022-08-14

Drori等。（2022）报告说：“神经网络通过计划的综合来解决，解释和产生大学数学问题，在人类层面上学习很少……[它]自动回答了81 \％的大学级数学问题。”他们描述的系统确实令人印象深刻。但是，上述描述夸大了。解决问题的工作不是由神经网络而是由符号代数软件包Sympy完成的。各种格式的问题被排除在考虑之外。所谓的“说明”只是代码行的重新词。答案被标记为问题中未指定的形式的正确。最严重的是，似乎在许多情况下，系统使用测试语料库中给出的正确答案来指导其解决问题的道路。

translated by 谷歌翻译